Noise, errors and information in quantum amplification 
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Abstract 



We analyze and compare the characterization of a quantum device in terms 
of noise, transmitted bit-error-rate (BER) and mutual information, showing how 
the noise description is meaningful only for Gaussian channels. After reviewing 
the description of a quantum communication channel, we study the insertion 
of an amplifier. We focus attention on the case of direct detection, where the 
linear amplifier has a 3 decibels noise figure, which is usually considered an 
unsurpassable limit, referred to as the standard quantum limit (SQL). Both 
noise and BER could be reduced using an ideal amplifier, which is feasible in 
■ principle. However, just a reduction of noise beyond the SQL does not generally 

correspond to an improvement of the BER or of the mutual information. This is 
the case of a laser amplifier, where saturation can greatly reduce the noise figure, 
although there is no corresponding improvement of the BER. Such mechanism 
is illustrated on the basis of Monte Carlo simulations. 

5 ! 1 Introduction 

03 
2 . 

In order to exploit the bandwidth available in the optical domain the evolution of optical 
communications urges conversion of hybrid electro-optical devices towards all-optical ones. 
For long distance communications the losses along the optical fiber decrease the transmitted 
power and introduce communication errors, thus being the crucial limitation to the 
development of this kind of technology. On the other hand, an amplifier along the line 
restores the power level, but introduces noise of quantum origin. For direct detection, which 
would allow to achieve the ultimate channel capacity , an optical phase insensitive amplifier 
used in the linear regime introduces 3 decibels of noise. Such noise has long been considered 
as an unsurpassable limit - the so called "standard quantum limit" (SQL) — which, 
however is just a peculiarity of the linear phase insensitive amplifier (PIA) j|. 

In order to recover the transparency of an optical network one needs to achieve a perfect 
amplification that, by itself, does not introduce any additional disturbance. Indeed, as 
suggested by Yuen ||, it is possible in principle to achieve such ideal amplification also 
for direct detection, although there is still no actual device that can attain it in practice. 
With this aim one could try to improve the performance of an existing amplifier by driving 
it far from the linear regime, and going beyond the SQL. In Ref. || it has been shown that 
this is indeed possible for a saturable laser amplifier, where in an intermediate-saturating 
regime one can accomplish noise suppression with still sizeable gains. The objective of Ref. 
H was originally to prove that the SQL can be actually breached in a concrete case. In a 
following debate || [| it has been pointed out that the noise is not the significant quantity for 
evaluating the goodness of an amplifier, and one should better resort to the original problem 
of the transmitted bit-error-rate (BER). However, Ref. || still considered the PIA as the 
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reference standard quantum amplifier, whereas, as clarified in Ref. 0, the ideal photon 
number amplifier (PNA) of Yuen indeed could greatly improve the transmitted BER of the 
PIA. 

In this paper we reconsider the problem of characterizing the quality of a low-noise 
amplifier from the beginning. We clarify that the quantity that unambiguously determines the 
behavior of a device inserted in a communication channel is the so called "mutual information" 
between the input and the output. For a binary channel, the BER is essentially equivalent 
to the mutual information for small error probabilities, whereas it remains an ambiguous 
characterization for any pathological situation. On the other hand, we will also show that the 
characterization in terms of noise is still meaningful for Gaussian communications channels. 

As the noise characterization || is not sufficient to establish whether a saturable amplifier 
can perform better than a linear one, here we present a careful analysis of a laser amplifier 
in a quantum regime, and compare it with an ideal PIA. We illustrate the mechanism 
that underlies a noise reduction under saturation, and explain why such reduction does not 
correspond to any improvement of the BER. We will show that this is typical of the saturation 
mechanism, whereas, in general, a noise reduction may correspond to an improvement of the 
BER, and, indeed, for an ideal PNA such an improvement can be actually achieved. 

The outline of the paper is the following. In Sec. [2] we give a brief introduction to 
the basic concepts of information theory that are needed for a complete characterization of 
the operation of a quantum amplifier. Such concepts are then specialized to the case of a 
quantum communication channel in Sec. ||, where a complete description of the transmission 
of quantum signals and the characterization of a quantum amplifier are given. In Sec. || 
the linear Gaussian channel is discussed. This channel is of particular interest because it 
is the only case where a quantum amplifier can be equivalently characterized in terms of 
mutual information and in terms of noise. In view of the subsequent comparison between 
the performances of a linear and a saturable amplifier, in Sec. || the conventional linear PIA 
is reviewed and the SQL is derived. In Sec. |6] the laser saturable amplifier is presented and 
analyzed on the basis of the Fokker-Planck equation derived by Haake and Lewenstein . In 
Sec. 0, after presenting some numerical tests of the theory and of the Monte Carlo simulation 
method, we compare the performance of the saturable amplifier to the one of the linear PIA. 
The conclusions are then given in Sec. [||. 

2 Basic concepts of information theory 

A communication channel between a transmitter and a receiver can be viewed as composed 
of three basic constituents: an encoder, a transmission line, and a decoder. The encoder 
transforms the input of the channel — i.e. the message to be transmitted — into a proper set of 
physical signals that are impinged into the transmission line. In this schematic representation 
the transmission line includes all possible physical processes that affect the signal that carries 
the message from the sender to the receiver, including all noise sources and any kind of 
device (e.g. amplifiers) inserted along the line. The decoder applies a set of operations on 
the received signal, like measurements and data processing of the results, and then gives the 
reconstructed message at the output of the channel. If no transformation modifies the signals 
along the line, then the encoded message can be perfectly reconstructed, otherwise errors can 
arise at the decoder, and the intrinsic potential of the channel decreases. 

The significant quantity that measures the efficiency of a communication channel is the 
mutual information. Let us define it first in the case of a classical discrete channel, and then 
generalize it later to the continuous and quantum cases. 

By definition, a discrete channel transmits only symbols belonging to a numerable set. 
Suppose that the input of the channel is a symbol from the set (or alphabet) {xj, j = 1, J} 
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of J elements with a priori probabilities {pj,j = 1, J} (J2i=iPi = 1)> whereas the output 
is a symbol from the set {yk, k = 1, K} of if elements (the two sets need not be equal). 
The two sets are linked by specifying the conditional probabilities Qk\j that the output is y^ 
given the symbol Xj at the input. The mutual information is given by 

m n=EE ^ io g2 =^g- « 

i=i fc=i 2^iPi^k\i 

and quantifies the degree of knowledge (in bits) that the output random variable Y gives 
about the input random variable X (by the capital letter X = {xj,pj,j = 1, •••,«/} we 
denote the random variable, namely the set of symbols {xi} along with the corresponding 
probability distribution {pi}). The mutual information can also be written as follows 

I(X;Y)=H(X)-H(X\Y) , (2) 

namely as the difference between the entropy at the input 

J 

H(X) = -^2 Pj log 2Pj (3) 

3=1 

and the conditional entropy at the output 

K J 

H(X\Y) = -Y / q k J2 P 3\k l°g 2 P 3\k , (4) 
k=l j=i 

where qu = J2iPiQk\i 1S the unconditioned probability of the output symbol y^ and 
Pj\k = PjQk\j/Qk is the conditioned probability that symbol Xj was transmitted given that 
symbol is received. In the limiting case X = Y that input and output alphabets coincide 
and the channel is noiseless (Qj\k = ^jk), one has 

I(X;X)=H(X), (5) 

namely the information transmitted through a noiseless channel is just the entropy of the 
input alphabet. In the opposite limiting case that the input and output random variables 
are completely uncorrelated {Qj\k = Qj for all k) the mutual information vanishes. 

The definition of mutual information (|l]) can be straightforwardly generalized to the case 
of a continuous set of symbols x £ X and y £ y, with a priori probability density p(x) and 
conditional probability density Q(y\x). Here the sums in ([]]) replaced by integrals as follows 

I(X; Y)= I dxf dy p{x)Q{y\x) log 2 . (6) 

Jx Jy J dz p(z)Q{y\z) 

Let us now consider the case of a binary channel, where two different symbols "0" and 
"1" (which make one "bit") are transmitted with equal a priori probabilities Po = Pi = 1/2. 
Usually a binary channel is characterized by the probability of making errors at the decoder, 
called "bit error rate" (BER), and defined as follows 

B = l(Qo\i + Qi\o) ■ (7) 

In Eq. (^) Qi\o is named "false alarm probability" (the probability of detecting "1" when 
the signal "0" was transmitted), whereas Qx\\ = 1 — Qoll i s called "detection probability". 
Among the possible choices of sets of independent probabilities, in the following we adopt 
the couple {Qo|iiQi|o}> m terms of which all quantities of interest (e.g. BER B and mutual 
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information /) can be expressed. The mutual information of the binary channel takes the 
form 



1 \n n M 2(1-^0) 2Qqh 
+ Qi\o^g 2 - — >—— + (l-Q | 1 )log 2 - ' 

1 — V0|1 + V1|0 i — V0|1 + V1|0 



-+ 



(8) 



In the binary channel the BER gives the same characterization as the mutual information in 
the limiting case of small error probabilities (Qi|o>Qo|l ^ !)• Here, I can be expanded at 
first order in Qi\q and Qo|l> an( i one nas 



B < 1 =^ 1^1- B . 



0) 



Notice, however, that the BER and the mutual information generally do not give the same 
description of the channel, as one can have channels with the same BER but having different 
mutual information, and vice versa. For instance, consider the case B = corresponding to 
Qo\i = Qi\o = (namely no errors are made in the identification of the binary symbol at the 
output). As mentioned before this means that the information is maximum, i.e. I(X; Y) = 1. 
On the other hand, for B = 1 one has Q |i = Qllo = 1) namely the output symbol is always 
interpreted in the wrong way: in this case, however, the mutual information is still unit, and, 
in fact, it is possible to reconstruct the transmitted message without ambiguity by swapping 
the symbols "0" and "1" at the output. Moreover, one can have different values of the mutual 
information that correspond to the same BER, just by varying the probabilities of error and 
keeping their sum fixed. 

As shown in the above examples, the mutual information gives a more complete 
description of the communication channel than the BER, and hence it should be adopted 
as the right quantity to characterize the channel. In the rest of this paper, however, we will 
use the BER, because in our case the probabilities of error are so small that BER is equivalent 
to the mutual information via Eq. (S). 



3 The quantum communication channel 

In this section we specialize the above concepts to the quantum communication channel, 
where the information is encoded on quantum states. In the first two subsections we give 
a characterization of the channel in terms of mutual information, and in terms of gain and 



noise figure of the devices inserted along the line. Finally, in Subsec. 3.2 we review the 
description of the dynamical evolution of the quantum state along the transmission line, with 
the insertion of both amplification and losses. 

3.1 Mutual information in the quantum channel 

In a quantum channel the symbols to be transmitted are encoded into density operators p x 
on the Hilbert space Ti of the dynamical system that supports the communication. The 
alphabet (which can be either discrete or continuous) is distributed according to an a priori 
probability density p(x), which also determines the mixture p given by 

p = J dx p(x)p x . (10) 

The set of states p x and the probability density p(x) are globally referred to as "encoding". 
The transmission line includes all dynamical transformations of the encoded states between 
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the encoder and the decoder. In the Schrodinger picture these are described by a completely 
positive (CP) map p — > pt = E t (p) acting on the density matrix p that carries the 
information through the channel. The last step in the communication channel is the decoding 
operation, where the recognition of the transmitted symbol is established as the result of a 
quantum measurement. In the general case this is conveniently described by means of a 
probability operator measure|| |j| (POM), which extends the notion of orthogonal-projector- 
valued measure associated to customary quantum observables. The POM dH(y) provides the 
probability distribution p[p](y) dy of the random outcome y of the measurement according 
to the rule 

P [p}(y)dy = TT[pdU(y)}. (11) 

In order to have a genuine probability dP[p](y), a POM must satisfy the positivity condition 

dU(y) > (12) 

as a consequence of the positivity of the density operator p. Normalization of the probability 
p(y) is guaranteed by the completeness relation 

J dU(y) = 1 . (13) 

Each measurement apparatus is described by a POM. Actually, the POM comes from a 
customary orthogonal projection-valued measure that includes also the degrees of freedom of 
the apparatus, which are partially traced out in order to give a description of the system alone. 
Notice that the correspondence between measuring apparatus and POM's is not one-to-one, 
since different detectors can be described by the same POM. In a quantum communication 
channel the POM specifies the conditional probabilities in the mutual information (||), namely 

Q(y\x) dy = Tt[E t ( Px ) dU(y)} . (14) 

The value of the mutual information depends on all the three steps of the communication 
channel: encoding, transmission and decoding. The maximum of I(X, Y) over all possible 
a priori probability densities p(x) for a given constraint (typically the average power along 
the line) and for an ideal channel (namely E% = 1 is the identical transformation) is called 
"capacity" C of the channel, whereas the global maximum obtained optimizing also over both 
encoding and decoding is called "ultimate quantum capacity". For fixed average power n, 
using the Holevo-Ozawa-Yuen information bound iQ it can be proved that the ultimate 
quantum capacity for a quantum-optical channel is achieved by direct detection and number- 
state encoding, with p being the thermal state with photon number n. 



3.2 Gain and noise in amplification 

Let us now consider the effects of the insertion of an amplifier in a transmission line. The 
performances of a quantum amplifier are described in terms of the gain and the disturbance 
caused by the device to the transmitted signal. As already discussed, the best quantity to 
characterize the disturbance is the loss of mutual information with respect to the case of ideal 
transmission. However, although the noise figure is significant only for Gaussian channels (see 
Sec. |j), this is the quantity that is customarily used to characterize the disturbance of the 
amplifier. 

Both gain and noise figure generally depend on the overall configuration of the channel, 
i.e. on the particular encoded states and on the kind of detection at the decoder. The gain 
is defined as the ratio between the output and the input signals as follows 

G = ^ , (15) 
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where the signals Si ntOU t are defined as the difference between expectation values 

Sin,out = Tr{0 Pin,out} ~ Tl{0 Vin i0ut }, (16) 

where p ou t = E t (pi n ), v ou t = E t (vi n ), E t is the CP map of the amplifier from the input to the 
output, O is the detected operator and v is a reference state. For optical amplifiers at the 
input of the channel the state V{ n is usually the vacuum, but it is generally transformed into 
a non- vacuum state v ou t as it evolves through the amplifier. An amplifier is called "linear" 
when the gain does not depend on the input state pi n . Notice that, as the signal depends 
on the detection scheme defined by the observable O, in principle the linear character of a 
device depends on the kind of detection at the output. 

The noise figure quantifies the degradation of the signal-to-noise ratio (SNR) from the 
input to the output, and is defined as 

R = SNR in / SNR out . (17) 

In Eq. ( p"7| ) the signal-to-noise ratio is given by 

SNR = S 2 /N , (18) 

where N is the quantum noise (A0 2 ) p = Tr(p0 2 ) - [Tr(pO)] 2 for detected observable O 
averaged over the alphabet with a priori probability distribution p(x), i.e. 

N = J dxp(x)(A0 2 x ) Px . (19) 

Notice that the definitions of both signal S and noise N can be easily generalized to the case 
of a detector described by a non orthogonal POM dH, just replacing the ensemble averages 
by 



(y% = Tr 



p J dU(y) y k 



(20) 



When characterizing the quality of an amplifier, usually one resorts to determining the 
noise added by the amplifier along the line. However, a quantitative characterization should 
be based on the change of mutual information due to the amplifier insertion. For this 
reason, we define as "ideal" an amplifier that does not degrade an optimized channel, namely 
that leaves the information invariant in a channel where the information has been already 
maximized over all possible encodings and decodings for a given constraint. As a rescaling of 
the probability distribution Q(y\x) — ► Q(G~ 1 y\x) leaves the mutual information in Eq. @ 
invariant, an amplifier that rescales the probability by the gain G — hence maps the observable 
O to GO — is ideal (the pathological situation of bounded spectrum is not considered, as in 
this case the concept itself of amplification is meaningless) . For such an ideal amplifier one has 
unit noise figure R = 1, independently of the input state. It is clear that for a non optimized 
channel, an amplifier can increase the mutual information, and, actually, this is its purpose. 
In fact, by rescaling the detection spectrum the amplifier expands the effective dimension of 
the Hilbert space that supports the relevant part of the transmitted information. The loss of 
information in a non optimized channel can be caused either by an irreversible dynamics along 
the line (this is the case of the loss), and/or by a mismatch between coding and detection, or 
by low quantum efficiency of the detector. In the first case it is clear that the amplifier must 
be inserted before the leak of information, namely as a preamplifier, whereas in the second 
case it can be inserted just before the detector, the quantum efficiency representing essentially 
an irreversible evolution inside the detector. Notice that the configuration of the channel with 
the amplifier before the loss may violate the physical constraint of limited power, and hence 
the best configuration is a distributed gain along the line. 
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In the case of a binary channel, with symbols "0" and "1" corresponding to the input 
states po = v and p (v being the vacuum), the gain is given by Eq. (15), and Eq. fll9| ) 
rewrites 



N = - (<AO 2 ) P0 + (AO 2 ),) . (21) 

In Sec. U we will show the equivalence between BER and noise figure in the characterization 
of a Gaussian binary channel, and in Subsec. the difference between the two quantities 
will be revealed in the cases of linear and saturable amplifiers. The characterization of an 
amplifier given in this section is also suited to describe an attenuator (i.e. a "deamplifier" ) or 
the loss itself, where now G < 1, whereas, generally, for an amplifier one strictly has G > 1. 



3.3 CP maps, master equation and Fokker - Planck equation. 

In this subsection we will briefly introduce the quantum treatment of the evolution equations 
for a signal state through a transmission line in the presence of losses or active devices (e.g. 
amplifiers). Both losses and amplifiers are treated as open systems [Q. In an amplifier the 
mode of radiation that undergoes amplification — the "signal" mode — resonantly interacts 
with other modes of radiation (parametric amplifier) or with matter degrees of freedom 
(active medium amplifier). The necessary energy for amplification is provided by "pump" 
modes, whereas "idler" modes guarantee resonance conditions and phase-matching. All 
modes different from the signal mode (i.e. the "system") act globally as a reservoir, and 
the signal mode undergoes a non unitary irreversible transformation (the only exception is 
the ideal phase sensitive amplifier, where the transformation is unitary). On the other hand, 
in the presence of a loss (a beam splitter or a lossy cavity/fiber) the signal mode is gradually 
lost into external modes. The dynamical evolution of the density matrix p of an open system 
is obtained by partially tracing over all the external degrees of freedom (globally referred to 
as the "probe" ) and is described by a map of the form 

p t = E t {p) = Trp[u t pp®pU}] , (22) 

where Ut is a unitary operator and pp is the density matrix for the probe. By expanding the 
trace over a (numerable) orthonormal set, Eq. (p2|) rewrites as follows 



Et{ P ) = £ V k pVi , V kVk = 1 , (23) 

h k 

which is the Stinespring representation |TJ] of a normal trace-preserving CP-map. The 
infinitesimal version of Eq. ( |22"D or ( ]23| ) is usually referred to as the "master equation". 
Lindblad proved that the most general form of a valid master equation is given by|l3| 

d tPt = J2v[A k ]p t , V[A]p = ApA^ - \ {A^A,p\ , (24) 
k z 

where {, } denotes the anticommutator, A k are generic complex operators, and T>[A] are 
called Lindblad super-operators (a "superoperator" is a map that acts on operators both on 
the left and on the right sides). 

The master equation can be transformed into a differential equation for an ordinary 
unknown c-number function (instead of an operator p) using the Wigner representation of 
the density matrix. When the order of derivatives can be truncated at the second one, the 
equation has the form of a Fokker Planck equation (FPE) 



d t W s (a,a*;t) 



d a a Q s (a,a*) + d a *a*Q*(a,a*) + ^V Q : D s (a,a*) 



W s (a,a*;t), (25) 
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where (Q s ,Qt) is the drift vector, D s the diffusion matrix, V Q is the matrix of second 
derivatives 



V, 



if 



if 



r) 2 r) 2 



(26) 



and W s (a,a*) is the generalized Wigner function of the radiation mode with annihilation 
operator a, defined as follows 



W 8 (a,a*) = / X s (A,A*)exp(aA* -a* A) . 

J vr 



(27) 



The Wigner function W s (a,a*) is the Fourier transform of the generating function of the 
s-ordered moments 



X a (X, A*) = Tr [pexp (Aa f - A* a)] exp Qs|A| 2 ^) 



with the moments given by 



Tr 



if 



dX n d(-X* 



X S (X,X*) 



A=0 



d 2 



a 



7T 



W s (a,a*)a* n a m . 



(28) 



(29) 



In the following sections we will only consider the case s = 0, which corresponds to 
symmetrical ordering. Here we notice that the functions W s (a,a*) are not generally 
probability distributions, as they are not positive definite, and for this reason they are named 
"quasi-probabilities" . Only for s < — 1 one has W s (a,a*) > for all a. In particular for 
s = — 1, which corresponds to antinormal ordering, W s (a,a*) is the probability distribution 
of the ideal measurement of the complex field (heterodyne detection) , whereas for s < — 1 it 
represents the same measurement with quantum efficiency r/het = 2/(1 — s) < 1. 

From the Wigner generalized function one can obtain the homodyne detection probability 
distribution in form of the marginal integration 



/dy 
— W s (x + iy,x-iy) , 
IT 



(30) 



and now the quantum efficiency r)hom of the homodyne detector is related to the ordering 
parameters by 7] hom = 1/(1 - s). 

In the rest of the present section we will consider a FPE with constant drift and diffusion 
coefficients of the form 



dtW s (a, a* 



Q(d a a + d a *a*) + 2D s d*a* 



W,(a,a*). 



(31) 



Eq. (|3l]) can be solved analytically, with solution in form of the following Gaussian 
convolution 



W s (a,a*;t) 



1 



irA 2 (t) 



d (3 exp 



| a — (3e 



-0*1 2 



W S (P,(3*;0), 



where 



A2(t) = ^(l-e-«* 



)• 



(32) 



(33) 



For homodyne detection the marginal integration of the FPE leads to the Ornstein-Uhlenbeck 
equation (OUE) 



d t P s (x;t) 



1 



Qd x x + -D s d 2 x 



P s (x;t) , 



(34) 
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again with solution in form of the Gaussian convolution 



P s (x;t) 



J dx' exp 



(x 



x e 



-Qt\2 



2d*(t) 



P a {x';0). 



where 



-2Qt\ 



(35) 



(36) 



Notice that the diffusion terms in (|3l| ) and ( 



generally depend on the efficiency of the 
that the gain for the 



detection apparatus. It is straightforward to see from Eqs.(32) and §i 
corresponding variable (the field amplitude for Eq. ([H]) and the quadrature for Eq. (|34|)) 
is given by G = e _< ^*, and does not depend on the input state. Therefore, the processes 
described by Eqs. (31) and (|34|) correspond to the case of linear amplification for Q < 0, 
or to the case of linear loss for Q > 0. When the diffusion coefficient in Eqs. (^) and 
(|34]) vanishes, the evolution of the corresponding probability distribution is just given by the 
rescaling of the field variable a, namely 



W s (a,a*;t) = -^W s (G~ 1 a, G _1 a*; 0) 
and, for the OUE, for the homodyne variable x 

P s (x;t) = ^P s (G~ 1 X ;0). 



(37) 



(38) 



In this case, for perfect detection (described either by W_\ or by Pq) the device is ideal. For 
detection with r\ < 1, one can even have D s < p| , which means that the ideal amplifier 
can improve the SNR and the mutual information. For heterodyne detection (W-i), the 
ideal amplifier is the so called phase insensitive amplifier (PIA), which will be described in 
detail in Sec. [|. For homodyne detection (Pq) the ideal amplifier is the phase sensitive 
amplifier (PSA), which is described by a Ornstein Uhlenbeck equation of the form (|34|) with 
vanishing diffusion term. For direct detection, namely detection of the photon number of 
the radiation field, the ideal amplifier is called "photon number amplifier" (PNA) [Q. In the 
following section, we will consider these linear devices inserted in a linear Gaussian channel 
with Gaussian input probability. 



4 The linear Gaussian channel 



A linear Gaussian channel is a particular type of channel where the evolution equation is given 
by Eq. ( p5|) or Eq. (34), i.e. it has a Gaussian Green function, and the encoded states at 
the input are themselves described by Gaussian probability distributions. The time-evolved 
probability distributions are easily obtained from Eqs. (|32|) and (|35|) as follows 



W s (a,a*;t) 



1 



vrA2(t) 



exp 



a 



-Qt\2 



A2(t) 



(39) 



and 

P s (x;t) 



exp 



A 2(t) = ^(1 - e - 2 ^) + A 2 s (0)e- 2Qt 



(x - x e Qt ) 2 



(40) 



d 2 (t) = ^(l-e- 2 ^) + d 2 (0)e- 2Qt .m 
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For a Gaussian channel, the probability distribution remains Gaussian at all times, and the 
evolution is completely characterized by the changes of mean value and variance, whereas the 
drift and diffusion coefficients are constant. In this simple case the mutual information and 
the noise figure can be easily computed for different configurations p[ . 

For the Gaussian channel, we now analyze the relations between the three different 
criteria: information, noise, and BER. 



Relation between noise and information 

If we consider a continuous alphabet described by Gaussian states with variance A 2 , and 
transmitted according to an a priori Gaussian probability distribution with variance 5 2 , the 
mutual information takes the form 

/ = ^log 2 (l + ^). (42) 

Hence the mutual information is just a function of the noise N = A 2 and of the variance a 
of the a priori probability distribution. 



Relation between noise and BER 

The use of the BER actually pertains to the case of a binary channel. For a Gaussian channel, 
the two states "0" and "1" have Gaussian probability distributions at the output centered 
around two different values. The two Gaussians are overlapping, and one discriminates 
between "0" and "1" upon introducing a threshold $, such that one has "0" signal for 
y < -d and "1" for y > i?. For output described by probability densities qo and qi, the 
error probabilities Qo l an d Qi\o are given by 



Qo\i = / dyqx(y) (43) 



Ql\0 = / dy q {y) , (44) 

The BER obviously depends on the threshold i?, which must be optimized in order to obtain 
the minimum value. For Gaussian functions this corresponds to putting -d at the crossing 
point between the graphs of the two output distribution probabilities, and the BER is just the 
overlap area. When the input states have the same variance, the optimal threshold $ = S/2 
is placed exactly in the middle between the mean values of the two distributions. From Eq. 
one has 

-&- s ™ 

where the symbol ~ stands for asymptotic value for large SNR. As the error function <E> is 
monotone, one can see that for binary Gaussian channels the SNR and the BER criteria are 
essentially the same. 

For non Gaussian channels, the equivalence between the three different criteria is generally 
not valid. In fact, we cannot establish a general rule for the time evolution of the shapes 
of the probability distributions. Therefore, there may occur cases where the noise figure is 
small because there is little broadening of the probability, but the BER is high because the 
broadening is not symmetrical around the mean value and the overlap region increases more 
than the area below the external tails of the distributions. As we will see in Sec. ||, this is 
the case of a saturable amplifier. In principle, there might be also opposite situations, where 
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the noise figure gets worse because of a broadening of the probabilities, but at the same 
time the BER improves because the broadening is dominant in the non overlapping region of 
the two probabilities. For non Gaussian channels, therefore, the characterization of a device 
performance in terms of noise figure may be significantly different from the one based on the 
BER analysis, and the latter must then be considered instead. 

5 The phase insensitive amplifier 

A phase insensitive linear amplifier is described by the following master equation 

d tPt = 2 \AD[at\ + BV[a}] p t , (46) 



where the superoperator T> is defined in Eq. (p4j) and A > B. As a consequence of the 
invariance T>\aeT % ^\ = T>[a], the device is phase insensitive, namely it amplifies the field 
independently of the value of its phase. The case B > A describes a linear attenuation 
process. With A and B proportional to the atomic populations of the upper and lower lasing 



levels respectively, Eq. (46) describes an active medium amplifier in the linear regime (i.e. 
far from saturation). On the other hand, for A = £m and B = ^(fh + 1) the same equation 
describes a field mode with photon lifetime damped towards the thermal distribution 
with fh average photons. 

Eq. (46) has the following general CP-map solution (lj] 

m itt rrti „ I exp[— arctan V e rt — l(atf — a^b)] (B > A) , 

Pt = Tr P [U tP ®v U}] , , t = | 4_ arctanh ^^ (a t 6 t _ a b)] \ A> B ][ ^ 

where T/2 = \A — B\, and the second field mode b is an additional mode, called "idler", that 
is initially in the thermal state v with average photons fh = min{j4, — B\ [A and B 

must be non negative, otherwise one would have negative idler photons]. The idler mode 
b is needed for the unitarity requirement of the quantum evolution of the fields pl)| . Ideal 
amplification is achieved when the idler mode b is initially in the vacuum state. In the case 
of finite temperature, when the mode b is initially in the thermal state with a non vanishing 
number of photons fh, the amplifier is not ideal (B ^ 0). 

Let us now study the performance of the PIA for different kinds of detection. Correspond- 
ing to the master equation fl46| ) one can obtain a Fokker-Planck equation of the form ( p5| ) 
with Q = B — A and 2D S = A + B + s(A — B). For ideal heterodyne detection (s = —1) the 
device is ideal for B = (ideal phase insensitive amplifier), because the diffusion coefficient 
vanishes. This corresponds, for example, to the case of complete inversion between the two 
lasing levels in an active medium amplifier. On the contrary, linear attenuation is always non 
ideal for heterodyne detection. The Ornstein Uhlenbeck equation has the form (|34j), and for 
ideal homodyne detection (s = 0) the PIA is never ideal. To study ideal direct detection, 
where the photon number operator is measured, we need the evolution of the number-state 
probability. From Eq. ( p7|) for A > B, and for input number state \m)(m\, we have 

f (e 2rt -l)"-" n \ r ^ 

P t (n) = (n\pt\n) = I e ^+i)rt ( n _ m)!m! Iorn - m , (48) 

I for n < m 

whereas in the case of input coherent state |a)(a| we have 

Rfn) = V^ — ^ - e-l Q l lal 2 ( n ^ (49) 

ha e 2(n+m h\[{n-hW e M ' 1 yj 

These results show that the PIA is not ideal for direct detection, since the output probability 
distribution is not simply rescaled. This is the cause of the unavoidable noise of quantum 
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nature that the PIA introduces for direct detection. Moreover, from Eq. fl4q), we see that 
for input number states a non-vanishing BER is developed during the time evolution starting 
from initial BER B = 0. Figure [l] illustrates the mechanism of BER occurrence for coherent 
state input. 




Figure 1: The picture shows the BER (shaded surface) at the output of a PIA. The left histogram 
is the output distribution corresponding to input bit "0" (i.e. the vacuum); the right histogram is 
the output distribution when the input is bit "1" (a coherent state, in this case). The vertical line 
indicates the value of the threshold d that optimizes the BER. 

We'll now analyze in more detail the role of the added noise. The output mode of the 
electromagnetic field is 

a ou t = U^dinU, (50) 

where U is defined in Eq. (|47|). The linear transformation between the fields is given by 

a out = G 1 ' 2 a m + (G-l) 1 ' 2 b\ n . (51) 

Thus, in the case of direct detection, the output average number of photons of the PIA is 

((ataUt) = G{{o)a) in ) + (G — l)(((tfb) in ) + 1), (52) 

where G = e rt is the amplifier gain. The idler mode b is responsible for a non vacuum output 
when there is no input (spontaneous emission), i.e. with the mode cij n in the vacuum state. 



The noise figure is obtained from the output noise defined in Eq. (21). For binary channels 
and under the condition of vacuum state for the bit "0" , the output noise is [||| 



Nout = -[G(n a ) hitl + 2(G-l)(n b + l)+2G(G-l)(n b + l)(n a ) hitl + 

+ 2{G - l) 2 (2n b + 1) + G 2 (n a ) bit x {F a - 1) + 2(G - lf(n b )(T b - 1)+ 
+ 2G(G-l)[(b^)(a^) + (b 2 )(a 2 ) 



(53) 
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where n a = a^a, = b^b and T is the Fano factor of the distribution, i.e. 

(54) 

(n) 

The various contributions to the noise are usually referred to as [following the same order 
in Eq. (|53|)]: (i) quantum fluctuations of the amplified signal; (ii) amplified parametric 
spontaneous emission; (iii) quantum beat between the first and the second terms; (iv) self- 
beat of parametric spontaneous emission; (v) excess noise of the signal and (vi) of the idler; 
(vii) coherent terms. It is easy to see that for strong input signals (n a ) 3> 1, high gain G S> 1, 
and vacuum idler state, the noise figure is 



2(n tt ) 
(Anl 



R^^Bk- (55) 



For coherent input states, the Fano factor is J- = 1, and the noise figure is 

R ~ 2 ~ 3 dB. (56) 

This is the minimal noise figure that can be achieved by a phase insensitive quantum linear 
amplifier for coherent input states and under the above assumptions: it is usually referred to 
as Standard Quantum Limit (SQL). As we have shown, and as shown in Ref. §3], this limit 
is not unsurpassable, but it is just a peculiarity of the linear character of the PIA. 



6 The saturable laser amplifier 

As we have seen in the previous section, the PIA is not ideal for direct detection. In this 
case the ideal amplifier is the PNA proposed by Yuen |Q. Even though the Hamiltonian for 
the PNA has been derived (see Refs. |L8j and pl[), its practical realization is unknown, and 
a way to approach such an ideal device by concrete amplifiers still remains an open problem. 
In this section we investigate the possibility of approximating the behavior of the PNA by 
means of a laser amplifier working in a non-linear regime. The laser will be studied on the 
basis of the theory of Haake and Lewenstein 0. The underlying master equation describes 
uncorrelated two level atoms interacting with a radiation mode in a cavity. The evolution 
of radiation in the laser is described by means of a FPE that is derived from the master 
equation after adiabatic elimination of the atomic variables. In this section we briefly recall 
the Haake-Lewenstein theory, giving the detailed FPE and a discussion of the validity limits 
of the theory. 



6.1 The master equation. 

The model describes N two level atoms interacting with a single mode of the electromagnetic 
field (dipole interaction in the rotating wave approximation), whereas the non-lasing lossy 
modes and the pump mechanisms are taken into account in form of baths. The complete 
master equation that describes atoms and radiation is given by 

d t R(t) = LR(t), (57) 

where R denotes the joint atom-radiation density matrix, and L is the Liouvillian 

L = L a + L f + L af , (58) 
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where the atomic contribution L a is 

L a = ]T Lj ; Lj = + a )V[a +j ) + - ^[a^] + 5 (tj. - ^ W*j]> (59) 

the field contribution is given by 

L f = ~fn th V[(J] + 7 (n t7l + l)P[o], (60) 

and the interaction term L a j is 

L af R(t) = g[S + a- S^a + ,R(t)]. (61) 

The quantities involved in the above equations have the following meaning: o"o is the 
stationary expectation value for the population inversion a z without field (non-saturated 
inversion); 7^ is the decay rate of the atomic polarization; 711 the decay rate of the population 
inversion; T> is the Lindblad superoperator defined in equation (|24j); <r + , <r_ and a z are the 
Pauli operators 



ff+= (o J)' CT - = (i 0)' a * = (l -1) ; (62) 



a is the field annihilation operator; 7 is the decay rate of the cavity; is the mean number 



of thermal photons in the cavity; £± = Ylj=i °"±j\ 9 is the electrical dipole coupling between 



atoms and radiation. 



6.2 Adiabatic expansion of the master equation 



From the master equation (^7|) one proceeds to eliminate the fast atomic degrees of freedom 
by means of an adiabatic expansion. The Liouvillian is separated into two parts 

L = L + AL, (63) 

where Lq is the zero order "fast" term and AL is the "slow" remaining Liouvillian. The zero 
order Liouvillian Lq drives the fast (atomic) variables toward equilibrium with the slow one 
(the lasing mode) with a characteristic decay time i*. Any initial state R(0) is evolved by 
Lq into a state i?*, after a time f » t„ such that LqR^ = 0. The spirit of the adiabatic 
approximation is to consider the dynamical evolution on a time scale t S> t* ■ Eventually, one 
obtains a second order expansion for the master equation of the slow variables in the form 

d tP (t) = {Tr [ALA + J* dr' AL(e L ° T ' - A Tr )ALA] }p(t), (64) 

where pit) = Tro[i?(i)] is the reduced slow density matrix describing the lasing mode alone, 
Tro denotes the partial trace over the fast (atomic) degrees of freedom, and 

A = J R,(Tr [ J R(0)]) _1 (65) 
is the atomic state that adiabatically follows the field. 
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6.3 The Fokker - Planck equation 



Using the Wigner function representation derived in Subsec. 3.2 the annihilation and creation 
operators become differential operators, and the master equation is written in form of a FPE. 
One introduces the Wigner function 

W(a,a*) = J ^e aA *- a * A Tr / [i?(t)e Aa+ - Va ], (66) 

where Tif denotes the partial trace over the field and hence W is still an atomic operator. 
In the Wigner representation the Liouvillian rewrites 

L f W= ^{d a a + 8 a *a* + (2n th + l)d 2 aa *}W (67) 



L af W = g 



aS+ - a*S-,W 



+ |5« (wS- + 5_ w) + 9 -d a * (s+W + WS + ^j , (68) 



whereas L a is unchanged. In the following we will consider zero thermal photons (nth = 0) 
in the lasing mode. 

We are now ready to perform the adiabatic expansion of the master equation described 



in Sec. and the Wigner representation (pa) makes this task much easier. We start from 
separating the total Liouvillian L = L a + Lf + L a f into the two parts Lq and AL in Eq. ( p3| ) . 
Haake and Lewenstein's procedure considers 

L W = L a + g[aS+ - a*5_, W] (69) 

AL = L f + 9 -d a (WS- + S-W^j + 9 -d a * (^S+W + WS+^j . (70) 

The expansion term AL contains the only parts of the Liouvillian that are responsible for 
the radiation evolution. 

Two hypotheses are needed to assure the validity of the theory. The photon lifetime in 
the cavity must be longer than the atomic decay rates 

7>7||,7-L- (71) 

Such condition is consistent with the adiabatic expansion, since the atoms must have a 
characteristic time slower than the radiation one. Moreover, the time scale at which we 
consider the solution of the FPE must be bigger than the time t* at which the atomic 
variables are traced out, namely 

t»t„~7f 1 ,7l 1 . (72) 

This condition guarantees that the atomic variables have reached the equilibrium with the 
field variables. 

Instead of the field variable a it is convenient to use the variable u = aj^fn~ s = 2ag/^y\ff±, 
where the field is rescaled by the number of saturating photons n s = -^pr- 



With the procedure outlined above, in Ref.0 the following FPE is obtained 

d t W(u, u*) = CW(u, u*), (73) 
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where now W is the usual c-number Wigner function of the lasing mode and the Liouvillian 
is the differential operator 



C — u u Q u + u *u Q u + UU D UU -\- d u * u * D uu + 20 UU *D UU *. 
The drift coefficients and the diffusion matrix in equation (|74|) are given by 



(74) 



Qu 



+ 
+ 



7 



1 



2a C 



+ 



2 [ 1 + |u|2 2n s ( 1 + |it| 2 ) 3 
4Cf 



[(i + /)H 2 -/] 



n s (l + |u| 2 ) 4 
C 

2n s (l + |u| 2 ) 4 



N(l - \u\ 2 ) - 2\u\ 



-2<7 2 iV|u| 2 + o- 2 (l - \u\ 2 ) + (3 + |u| 2 )(l + |u 



2\2 



(75) 



D„, 



Cju 2 



4n a (l + |«| 2 ) 3 



[a 2 (l + 2/) + (l + H 2 ) 2 ] 



(76) 



A, 



i(l + \ u \^(2 + \u\ 2 )-\u\ 2 a£(l + 2f)] 



(77) 



with / = 2^ and C = denoting the cooperation parameter of the laser. 
The independent parameters that appear in the Fokker-Planck equation are six: the 
cooperation parameter C, the non-saturated population inversion o"o, the number of lasing 
atoms N, the decay rate of the optical cavity 7, the ratio between the atomic decay rates /, 
the number of saturating photons n s . For the validity of the FPE we remember that these 
parameters must satisfy the following conditions: 



1. For the adiabatic approximation one has 711,71. ^> 7> which implies that 



N Nf 

— < 4C, — < 2C. 

n s n s 



(78) 



2. For the trace time i* condition one has t^>t* ~ 7,, , 7_ L 1 , which implies 

1 4n s C 2n s C 
~t ^ N ' fN ' 

where t is the time scale at which we consider the solution of the FPE. 



that 



(79) 



3. For the validity of the adiabatic expansion (— 2 — <C 1) one needs large saturation 



numbers 



7||7x 



n s > 



(80) 



With the above theoretical description of the laser we can now study the performance of a 
laser amplifier used as a traveling wave optical amplifier (TWOA), where both cavity mirrors 
are semitransparent with high transmissivity: one mirror lets the input radiation enter the 
cavity, while the other lets the amplified radiation exit. In Subsec. [7^ we will describe the 
TWOA by the laser FPE (|73|), with the initial Wigner function corresponding to the input 
state of radiation. 
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7 Monte Carlo simulations: numerical results 



We have studied the laser FPE equation (|73| ) numerically using the Monte Carlo simulation 
method originally proposed in Ref . [jl9| . In this section we present the numerical results from 
simulations and compare the laser amplifier with the ideal PIA in terms of noise and bit error 
rate. 



7.1 Some tests of the theory and of the simulation method 

Ref. [jl9| includes extensive checks of the Monte Carlo simulation method on the basis 
of analytical models. Additional checks to tune the numerical parameters (such as the 
integration time step) for a specific model can be performed by comparing the simulation 
results for stationary quantities with those from the method of the pseudopotential. Also, 
very easily, one can check results in the linear regime against the PIA. Regarding the validity 
limits of the FPE ( |73|) and the underlying adiabatic approximation, a careful test is in order, 
also in consideration of alternative adiabatic approximations that lead to different Fokker- 
Planck equations, as, for example, in the theory by Lugiato, Casagrande and Pizzuto |15[j 
that was considered in Ref. M . In the derivation of Ref. ]l5| , differently from Ref. |tJ , the 
zeroth order part of the Liouvillian Lq contains L a and Lf, whereas AL = L a f corresponds 
to a perturbation expansion in g, rather than an adiabatic approximation. As a result, the 
FPE has the same diffusion matrix, but the drift lacks the second order contribution in 
the adiabatic expansion parameters. A test of the goodness of the FPE can be done by 
comparing results versus those from the solution of the master equation (^) without the 
adiabatic approximation, evaluating the partial trace over the atomic Hilbert space after the 
time in Eq. ([T2]). The master equation, in turn, can be solved using the Quantum Jump 
(QJ) method \L7^ , and in this way one has a completely alternative simulation for the 
laser and a very stringent test for the FPE (|73|). 

In Fig. [2] we report a sample test with the detailed number probability distribution for 
the stationary state from the QJ method for a one-atom laser versus the results of simulations 
of the FPE of both Refs. and |l5| . One can see that in this regime the FPE of Ref. 
perfectly reproduces results from the QJ, whereas there are discrepancies with results from 
the FPE of Ref. [Q. We have analyzed a wide range of parameters, and found always perfect 
agreement for the FPE (|73|), within the validity limits given in Subsec. |6.3| . 



7.2 Comparison between linear and saturable amplifiers 

In this subsection we compare the ideal (i.e. with vacuum idler) PIA and the laser saturable 
amplifier described by the FPE (|73|). The amplifier is inserted in a binary communication 
channel, with the "0" bit encoded on the vacuum state |0) and the "1" bit encoded on a fixed 
coherent state \a). We analyze the gain G, the noise figure R and the bit error rate B. In 
the linear regime the laser is equivalent to a PIA with gain 

G = exp[2 7 t(l - 2a C)) , (81) 

but with a non vacuum idler mode with number of photons 

min{C(l + <ro),C(l-<ro) + l} 
Ub = \2Ca - 1| ' (82) 

From Eq. ( |8~ID one can see that the laser amplifies the radiation if the threshold condition 
Co > 1/2C is satisfied. The idler photons produce additional noise (see Eq. (^3|)) and the 
ideal PIA has always a better noise figure than the laser. Correspondingly, from numerical 
simulations we found that the ideal PIA also performs better than the laser in terms of BER. 
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Figure 2: The output photon number distribution for a laser. The histogram with the error bars 
comes from a Quantum Jump simulation of a one atom laser. The other two histograms are obtained 
from the theories of Haake et al. jjj (histogram on the left) and Lugiato et al. |t5| (histogram on the 
right). Note the agreement of the theory of Haake et al. with the numerical solution of the master 
equation. The laser parameters used in the simulation are C — 30; / = 1; n s — 15; N = 1; Co — 
0.05; 7 i = oo. 

In the saturation regime, on the other hand, the noise figure of the laser can be much smaller 
than the 3-dB standard noise figure of the PIA, as assessed in Refs. p[,po[. However, in such 
a regime, the noise figure R is not the significant criterion of goodness to be considered, as 
explained in Sec. |3] and pointed out in Refs. and p. As a matter of fact, the numerical 
analysis shows that the BER for the laser is always worse than (or, at best, very close to) that 
of the PIA, even if the noise figure for the laser is very small. The mechanism leading to such 
discrepancy between noise figure and BER criteria is illustrated in Fig. ^, where the output 
probability distribution of the two amplifiers is plotted in a regime of strong saturation for 
the laser. 

The saturated laser tends to cut off the right tail of the output distributions for high 
photon numbers. On the other hand, the ideal PIA, being a linear device, evolves both tails 
of the distribution in the same way. If we analyze the two amplifiers at the same gain, the left 
tail of the bit "1" distribution will be higher for the saturable laser than for the ideal PIA, 
since saturation needs a higher contribution from the left tail in order to achieve the same 
gain, and the left tail is less influenced by saturation than the right one. For the same reason 
the bit "0" distribution at the output of the laser turns out to be much more broadened than 
for the PIA. As a consequence the overlap region of the two ("0" and "1") distributions - i.e. 
the bit error rate - is higher in the saturable laser than in the PIA. 



In this paper we have shown that, for direct detection, saturation effects in a traveling wave 
laser amplifier cannot yield BER better than those for a PIA, although they greatly reduce 



8 Conclusions. 
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Fi gure 3: Comparison between the output photon number distributions p n of the saturable 
amplifier (left figure) and ideal PIA (right figure). The parameters used in this simulation are 
C = 4.5; n s = N = 55; <jq = 1; "ft = 0.2; |a| = 3.95 (amplitude of the input coherent state 
pertaining bit "1"). This amplifier has a gain of 6.873 dB. 

the noise figure greatly below the SQL of 3 dB. In fact, the saturation mechanism broadens 
the output probability distributions asymmetrically, in such a way that the noise figure is 
improved, whereas the overlap between the distributions, which gives the BER, is increased. 
This mechanism has been shown on the basis of a careful Monte Carlo simulation of a traveling 
wave laser amplifier. The question now is: is it possible to improve the BER of the PIA in 
some way? The answer is clearly affirmative, if one just considers that at least the ideal PNA 
could in principle achieve zero BER when amplifying input number states, or has B = e - l"l 
independent of the gain G for input bit "1" in the coherent state \a), a value anyway much 
smaller than the corresponding BER for the PIA. Now the problem is how to achieve or 
to approach a PNA by a real device, and from the present analysis we conclude that the 
saturated laser does not approximate a PNA better than a PIA, even though it has a very 
low noise figure. 
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